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For the errors in variables model X = U + V, Y = pf(U) + W, sufficient conditions are given for the L.S. 
limiting estimate of p to satisfy P^P < 1) = 1 or PlfVp > 1) = 1 as the sample size tends to infinity. 
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The problem of linear regression when both variables are subject to error is known to be difficult, see 
Madansky [l], 1 and Moran [2], In particular under general conditions there is no consistent estimator for p 
in the model (1.1), (1.2) below based upon only the first two moments of A' and Y. Let 

X = U + V (1-1) 

Y = p/(t/) + W, where (1.2) 

U, V, and W are unobservable independent random variables with£V=£W = 0. In addition p is an unknown 
constant and /is a given function. We suppose that EX 2 and EY 2 are finite. This is known as the structural 
form of the errors in variables problem. Since there is a great deal of confusion in the literature between the 
case when U is a random variable and when U is not (the functional case), only the structural case is dealt 
with directly. Parallel results for the functional case can be obtained in a straightforward manner. These 
results will, however, restrict the values that a sequence of constants Uy, . . . , U a can take. 
The least squares estimate of p is, of course, 



3 = 2 YJVd/l, C/(*.)) 2 



(2) 



where the observable random pairs [X t , Y t ) i= 1, . . , n, are independent and have the same joint distribution 
as X and Y, see (1.1) and (1.2). It is well known that when 3^0, and f(X) = X that 



<H 



1 as n -> a* . (3) 



(The least squares estimate is biased toward zero.) 

It is also known (see Kendall and Stuart [3]) that for f(X) = X*, (k= 1, 2, . . .), and X and Y are normal 
that result (3) holds. However until now general conditions under which (3) holds were not available. We 
give sufficient conditions under which either (3) (Theorem 1) or the opposite result (Theorem 2) 



<H 



n -> co holds. (4) 



While the author has not found these two results in the literature he believes that they may be well known 
by somebody. 

• Center for Applied Mathematics, National Measurement Laboratory. 

■ Figure* in brackets indicate literature references at the end of this paper. 
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THEOREM 1. If({Xf is convex and not constant a.s. then (3) holds for p ^ 0. 

Proof: 

P s/(t/,.)/(x,.) i zw/jx) _ s/^j/gq 

P 2/(^) 2 p S/fj:) 3 Z/M 2 + U " {n } 

Since /^(X) is convex it follows by Jensen's inequality that 



(5) 



E/(X) 2 ^ E(f(E[X\U])) 2 = W) 2 (6) 

From another application of Jensen's inequality we have 

Ef(xy > E(E\J[X)\U}) 2 (7) 

In addition notice that 

mu)f(X) = E(f(V)E[f(X)\U)] 
^ E(\f(U)\\E[f(X)\U]\) * (E(f(Vy)E(E[f(X)\U}) 2 y 2 

by the Cauchy-Swarlz inequality. Therefore by (6) and (7) 

(Ef(xyy * (E/{u)f(x)y 

which implies that 

Ef(xy > Ef(U)f(X) 

The theorem now follows by applying the strong law of large numbers to the terms ■■■■ ' n and 

n m (4). 

If/ 2 is not convex a positive p may be overestimated. Theorem 2 provides the necessary support for this 
statement. 

THEOREM 2.1fi has two continuous derivatives and satisfies 

^>M ,8, 

for 2 in some interval I then there exists distributions for U, V, and W suck that (4) holds. 

PROOF: Following statements in the proof of Theorem 1 it is sufficient to show that there exists distributions 
for U, V, and W such that 

Ef(uy(x) > Ef(xy. 

Since /has two continuous derivatives it follows that 

f(X) =f(U) +/'(t/)v+/"(e,)y 

f(xy =f(uy + 2f(uy(u)v + [ne^e^ + (/-(e^nv 2 

where 0j and 2 are points between U and V. 
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Take Uq to be in /. Let *l be any point such that 

inf 11MM _ f&y^ > {f\ Z2 ))\ (9) 

Such a point T) exists because f is continuous, and {8) holds by hypothesis. Take V to have a two point 
distribution 

P(V = i\) = P(V=—t\) = 1/2. 

Then 

E{f{a,)f{X) - f(X)) 

= /M\f(uo) + Ef"(e,) jj - [/W + ~£(Ae 2 )/(e 2 ) + /'(e 2 ) 2 )l > o . 

Finally we note that, since this last inequality is strict, U may have uniform distribution in a narrow interval 
around the chosen point U . 

EXAMPLE 1: Let /(A) = X a , X > 0. Then, if a 2* 1/2 the conditions of Theorem 1 are satisfied. On the 
other hand if a < 1/3, the conditions of Theorem 2 are satisfied. This example is important for NBS standards 
work for concrete strength, see [4], (However, the functional case is appropriate.) It is also important for 
background characterization in x-ray spectroscopy. 

EXAMPLE 2: The conditions of theorem 2 are not necessary. lf/(X) satisfies condition (8) for Z a rational 
number in the unit interval and arbitrary elsewhere, then the proof of Theorem 2 can be used to construct 
distributions such that (4) holds. 

COMMENT 1: The conditions of Theorems 1 and 2 can be used to check parameters are estimated in the 
new model. 

X as in 1.1 (1.3) 

Y' = Zajhj(U) + W (for example h j {x)=x i ~ Y ) 

by considering orthogonalized fy's, see Ferguson [5]. Since {X — a) 2 is always convex Theorem 1 holds for 
the slope in the linear case when a constant term is in the model (1.1), (1.3). 

Comment 2: While the results given here directly relate to the property of being biased toward or away 
from the origin they do not relate to attenuation of slope. Attenuation requires the extra condition that 



<H 



1 as n ■ 



To see that this extra condition may fail in Theorem 1 take 

/(*) = x 2 — 2cx, where c is a fixed positive constant, 
P(0 =£ U =S c ) = 1, and P(\V\ S* 4c) = 1. 
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